OverviewOur team builds the Microsoft Inference Cloud, a scalable and reliable service that delivers SaaS inferencing with large language models (LLMs) on multi billion dollar GPU capacity. Inference cloud is the foundational service for all Microsoft Copilots, Azure OpenAI, and ISV hosted models like Llama, Mistral, Cohere, and others, providing a unified and consistent e